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This paper shows an application of Bloom and Esik’s iteration algebras to model graph data in a graph 
database query language. About twenty years ago, Buneman et al. developed a graph database query 
language UnQL on the top of a functional meta-language UnCAL for describing and manipulating 
graphs. Recently, the functional programming community has shown renewed interest in UnCAL, 
because it provides an efficient graph transformation language which is useful for various applica¬ 
tions, such as bidirectional computation. However, no mathematical semantics of UnQL/UnCAL 
graphs has been developed. In this paper, we give an equational axiomatisation and algebraic seman¬ 
tics of UnCAL graphs. The main result of this paper is to prove that completeness of our equational 
axioms for UnCAL for the original bisimulation of UnCAL graphs via iteration algebras. Another 
benefit of algebraic semantics is a clean characterisation of structural recursion on graphs using free 
iteration algebra. 


1 Introduction 

Graph database is used as a back-end of various web and net services, and therefore it is one of the 
important software systems in the Internet society. About twenty years ago, Buneman et al. lH |7] |8l 
developed a graph database query language UnQL (Unstructured data Query Language) on top of a 
functional meta-language UnCAL (Unstructured Calculus) for describing and manipulating graph data. 
The term “unstructured” is used to refer to unstructured or semi-structured data, i.e., data having no 
assumed format in a database (in contrast to relational database). Recently, the functional programming 
community found a new application area of UnCAL in so-called bidirectional transformations on graph 
data, because it provides an efficient graph transformation language. The theory and practice of UnCAL 
have been extended and refined in various directions (e.g. HE ns El D), which has increased the 
importance of UnCAL. 

In this paper, we give a more conceptual understanding of UnCAL using semantics of type theory 
and fixed points. We give an equational axiomatisation and algebraic semantics of UnCAL graphs. The 
main result of this paper is to prove completeness of our equational axioms for UnCAL for the original 
bisimulation of UnCAL graphs via iteration algebras. Another benefit of algebraic semantics is a clean 
characterisation of the computation mechanism of UnCAL called “structural recursion on graphs” using 
free iteration algebra. 

UnCAL Overview. We begin by introducing UnCAL. UnCAL deals with graphs in a graph database. 
Hence, it is better to start with viewing how concrete semi-structured data is processed in UnCAL. 
Consider the semi-structured data sd below which is taken from fSl- 
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It contains information about country, e.g. geography, people, government, etc. 
It is depicted as a tree above, 
in which edges and leaves are 
labelled. Using UnCAL’s term 
language for describing graphs 
(and trees), this is defined by 
sd shown at right. Then we can 
define funcfions in UnCAL fo 
process dafa. For example, a 
function fhaf refrieves all efhnic groups in fhe graph can be defined simply by 

sfun fl(L:T) = if L = ethnicGroup then (result:!) else fl(T) 

The keyword sfun denofes a funclion definition by structural recursion on graphs, which is fhe compu- 
fafional mechanism of UnCAL. Executing if, we can certainly exfracf: 

fl(sd) {result:"Celtic", result:"Portuguese", result:"Italian"} 

The nofafion {■■■:■■■,■■■} is a part 
of fhe UnCAL’s ferm language for rep- 
resenfing graphs. If consisfs of mark¬ 
ers X, labelled edges f:t, vertical com- 
posifions sot, horizonfal composifions 
{s,t), ofher horizonfal compositions 
sot merging roofs, forming cycles 
cycle(f), consfanfs {},(), and defini- 
fions {x < t). These term consfrucfions 
have underlying graph fheorefic mean¬ 
ing shown af fh righf. Namely, fhese 
are officially defined as operafions on 
fhe ordinary represenfafions of graphs: 

(vertices sef, edges sef, leaves, roofs)- 
fuples {y,E,{yi,...ym},{x\,...,Xn\), buf 

we do nol use fhe graph fheorefic definitions of fhese operafions in fhis paper. 
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Figure 1: Graph theoretic definitions of constructors (8) 

Slightly changed notation. Correspondence between the original and this paper’s: 
&y = y, @ = o, © = ->, {-:=-) = -< 


sd < country:{name:"Luxembourg", 
geography:{coordinates:{long:"49 45N", lat:"6 lOE"}, 
area:{total:2586, land:2586}}, 
people:{population:425017, 

ethnicGroup:"Celtic", 
ethnicGroup:"Portuguese", 
ethnicGroup:"Italian"}, 

government:{executive:{chiefOfState:{name:"Jean",..}}}} 
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UnCAL deals with graphs 


&L 




modulo bisimulation (i.e. not only modulo graph isomorphism). 
An UnCAL graph is directed and have (possibly multiple) root(s) 
written & (or multiple • • • Xn) and leaves (written yi • • -ym), and 
with the roots and leaves drawn pictorially at the top and bottom, 
respectively. The symbols x,yi,y 2 ,& in the figures and terms are 
called markers, which are the names of nodes in a graph and are 
used for references for cycles. Also, they are used as port names 
to connect two graphs. A dotted line labelled s is called an £-edge, 
which is a “virtual” edge connecting two nodes directly. This is 
achieved by identifying graphs by extended bisimulation, which 
ignores e-edges suitably in UnCAL. The UnCAL graph G shown 
at the left is an example. This is extended bisimilar to a graph that 
reduces all e-edges. Using UnCAL’s language, G is represented 
Figure 2: Graph G and bisimilar one as the following term to 

to = a:({b:x} u {c:x}) o cycle(x < d:({p:yi} u {q:y 2 } u {r:x))). 



UnCAL’s structural recursive function works also on cycle. For example, define anofher function 

sfun f2(L:T) = a:£2(1) 
fhaf replaces every edge wifh a. As expecfed, 

f2(tG) a:({a:x} u {a:x)) o cycle(x < a:({a:yi} u {a:y 2 } u {a:x})) 


where all labels are changed fo a. 

Anofher characferisfic role of bisimulafion is fhaf if idenfifies expansion of cycles. For example, a 
ferm cycle(& < a: &) corresponds fo fhe graph shown below af fhe leffmosl. If is bisimilar fo fhe righf 
ones, especially fhe infinitely expanded graph shown af fhe righfmosf, which has no cycle. 



These are in ferm nofafion: 

cycle(& < a:&) 



a:cycle(& < a:&) ~ a:a:cycle(& < a:&) 


Problems. There have been no algebraic laws thaf esfablish fhe above expansion of cycle. Namely, 
fhese are merely bisimilar, and nol a consequence of any algebraic law. But obviously, we expect that it 
should be a consequence of the algebraic law affixed point property of cycle. 

In the original and subsequent formulation of UnCAL |l8l[T7]|T8l|T], there are complications of this 
kind. The relationship between terms and graphs in UnCAL is not a one-to-one correspondence. No term 
notation exits for £-edges and infinite graphs (generated by the cycle construct), thus the rightmost infinite 
graphs of the above expansion cannot be expressed in syntax. But such an infinite graph is allowed as 
a possible graph in the original formulation of UnCAL. Consequently, instead of terms, one must use 
graphs and graph theoretic reasoning with care of bisimulation to reason about UnCAL. Therefore, a 
property in UnCAL could not be established only using induction on terms. That fact sometime makes 
some proofs about UnCAL quite complicated. 
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Because UnCAL graphs are identified by bisimulation, it is necessary to use a procedure or algorithm 
to check the bisimilarity as in the cycle example above. Listing some typical valid equations for the 
bisimulation can be a shortcut mUll, but it was only sound and not complete for bisimulation. 

Hence, we give an algebraic and type-theoretic formulation of UnCAL by giving equational axioms 
of UnCAL graphs. In this paper, we prove completeness of our proposed axioms using iteration algebra 
Il4l . Thus we have a complete syntactic axiomatisations of the equality on UnQL/UnCAL graphs, as a 
set of axioms capturing the original bisimulation, without touching graphs, e-edges, and the notion of 
bisimulation explicitly. We prove it by connecting it with the algebraic axiomatisations of hi simulation 

Hill. 

How to model UnCAL and structural recursion. The first idea to understand UnCAL is to interpret 
it as a categorical structure. We can regard edges as morphisms (of the opposite directions), the vertical 
composition o as the composition of arrows, and cycle as a fixpoint operator in a suitable category. 
Thus the target categorical structure should have a notion of fixpoint, which has been studied in iteration 
theories of Bloom and Esik IJj. In particular, iteration categories ifTOl are suitable, which are traced 
cartesian categories l(20]l (monoidal version is used in Hasegawa’s modelling of cyclic sharing theories 
ifT^fTSl l additionally satisfying the commutative identities axiom [31 (see also |[^ Section 2 for a useful 
overview around this). 

We also need to model UnCAL’s computational mechanism: “structural recursion on graphs”. The 
general form of the definition of structural recursive function is 

sfun FQf :t') - e (★) 

where e can involve F{t). The graph algorithm in f8| provide a transformation of graphs that produces 
some computed graphs using the definition (★). It becomes a function F satisfying the equations ([8| 
Prop. 3): 


F{yi) = yi F{(x< t)) = {x< F(t)) F{ fu )^e 

F(()) = () F{ sot ) = F{s)oF(t) F{ sot ) = F{s)oF{t) •••(><) (1) 

F{{}) = {} F{ {s,t) )^{F{s),F{t)) F( cycle(0 ) -cycle(F(0) •••(><) 

when e does not depencii] on 1. This is understandable naturally as the example £2 recurses structurally 
the term tc- Combining the above categorical viewpoint, F can be understood as a functor that preserves 
cycle and products (thus a traced cartesian functor). A categorical semantics of UnCAL can be given 
along this idea, which will be reported elsewhere. This idea works for simple cases of structural recursion 
such as £2. 

However, there is a critical mismatch between the above categorical view and UnCAL’s structural 
recursion of more involved cases. Buneman et al. mentioned a condition that the above nine equations 
hold only when e does not depend on t in (★). Two equations marked (x) do not hold in general if e does 
depend on t (other seven equations do hold). Crucially, £1 is already this case, where T appears as not 
of the form £1(T). The following another example shows why (x) do not hold: the structural recursive 
function aa? tests whether the argument contains “a: a 

sfun a?(L:T) = if L=a then true:{) else {) 
sfun aa?(L:T) = if L=a then a?CT) else aa?(T) 

The definition of aa? does depend on T at the “then”-clause. Then we have the inequalities: 

'Here “e depends on f” means that e contains t other than the form F(t). 
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aa?( (a:&)oCa:{))) = aa?( a:a:{} ) = true:{} {} = {}o{} = aa?(a:&) o aa?(a:{}) 

aa?( cycle(a:&) ) = aa?( a:a:cycle(a:&) ) = true:{} {} = cycle({)) = cycle(aa?(a:&)) 

This means that F does not preserve cycle in general, and even is not functorial, thus the categorical 
view seems not helpful to understand this pattern of recursion. 

In this paper, we consider algebraic semantics of UnCAL using the notion of iteration E-algebras 
im |T2l in ^ It solve the problem mentioned above, i.e. we derive the structural recursion even when 
the case that e depends on t within the algebraic semantics. 

Organisation. This paper is organised as follows. We first give a framework of equational theory for 
UnCAL graphs by reformulating UnCAL graph data in a type theoretic manner in Section |2] We then 
give algebraic semantics of UnCAL using iteration E-algebras in Section [3] We prove completeness of 
our axioms for UnCAL graphs for bisimulation in Section 13.31 We further derive structural recursion 
on UnCAL graphs in Section 13.51 Finally, in Section 13.61 we show several examples how structural 
recursive functions on graphs are modeled. 

2 UnCAL and its Equational Theory 

We give a framework of equational theory for UnCAL graphs. We reformulate UnCAL graph data in a 
type theoretic manner. We do not employ the graph theoretic and operational concepts (such as e-edges, 
bisimulation, and the graph theoretic definitions in Fig. [U. Instead, we give an algebraic axiomatisation 
of UnCAL graphs following the tradition of categorical type theory [9']. The syntax in this paper is 
slightly modified from the original presentation [S’] to reflect the categorical idea, which may be more 
readable for the reader familiar with categorical type theory. 

2.1 Syntax 

Markers and contexts. We assume an infinite set of symbols called markers, denoted by typically 
x,y,z,--- One can understand markers as variables in a type theory. The marker denoted by & is called 
the default marker, which is just a default choice of a marker having no special property. Let L be a 
set of labels. A label f is a symbol (e.g. a,b, c,... in Fig. |2]l. A context, denoted by {{x\,X 2 ,- ■ ■)), is a 
sequence of pairwise distinct markers . We typically use X,Y,Z,... for contexts. We use (()) for the empty 
contexts, X, Y for the concatenation, and |A| for its length. We may use the vector notation x for sequence 
x\,...,Xr,. The outermost bracket (( )) of a context may be omitted. We may use the abbreviations for 
the empty context 0 = (()). Note that the concatenation may need suitable renaming to satisfy pairwise 
distinctness of markers. 

Raw terms. 

t ::= yy \ F.t \ sot \ {s,t} \ cycle^(0 | {}y | ()y | A | (x < t) 

We assume several conventions to simplify the presentation of theory. We often omit subscripts or 
superscripts such as Y when they are unimportant or inferable. We identify {{s, t), u) with {s,{t, «)); 
thus we will freely omit parentheses as (fi,..., L). A constant A express a branch in a tree, and we call 
the symbol A a man, because it is similar to the shape of a kanji or Chinese character meaning a man, 
which is originated from the figure of a man having two legs (and the top is a head). 
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(Nil) 


y h{; & 


(Emp) 


Y^Oy :«» 


(Man) 


yi,y 2 I- A«yi, 3 , 2 » 


(Com) 


F hi :Z X ht :Y 
X r sot : Z 


F h i : Ai F h f : ^2 

(Pair) 


(Label) - 

(Cyc) 


’eL F h f ; & 
Y bfu : & 

FA h f :A 


Fh(s,f) :Ai,A 2 ' F hcycle^(0 :A 

Figure 3: Typing rules 


(Mark) 

(Def) 


y^<(yi,...,yn» 
F h ytY : & 

F h f ; & 

F h (x < f) : X 


Abbreviations. We use the following abbreviations. 


{i}u{f} = Ao{s,t) sxt = (s o TTi, t o 7T2) Ax = (idxJdx) 

^2 = y{{x,y)) - ^l({xi))'>^"''Y~Xn((^x„)) 

Inheriting the eonvention of -), we also identify {sxt)xu with sx{tx u), thus we omit parentheses 
ast\X...xtn- 

2.2 Typed syntax 

For contexts A, F, we inductively define a judgment relation F h t : A of terms by the typing rules in Fig. 
|3] We call a marker/ree in t when it occurs in t other than the left hand-side of a definition (x < s). In 
a judgment, free markers in t are always taken from F. Thus F is a variable context (which we call the 
source context) in ordinary type theory, and A is the roots (which we call the target context or type). For 
example, the term tc in 3I]is well-typed yi,y 2 i- tc : which corresponds a graph in Fig. |2j where 

the marker &is the name of the root. When t is well-typed by the typing rules, we call t a (well-typed 
UnCAL) term. We identify t of type & with (& < t). 


Definition 2.1 (Substitution) Let F = ((yi • • • ,yk)), W be contexts such that |F| < |W| and F can be em¬ 
bedded into W in an order-preserving manner, and Y' is the subsequence of W deleting all of F (NB. 
|VF| = |F| -I- |F'|, Y' is possibly empty). Suppose W H f : A, Z h 5,- : ((y,)) (1 < / < k). Then a 

substitution Z, F' h t [y i-> : A is inductively defined as follows. 


Fr [y 1-^ - 

X [y ^ 

{}y [y ^ 

Or [y- 
{eu)\f^^ A 


Si 

X (if X in F') 
{ \z+Y' 

Oz+Y’ 

f:{t\yY^^) 


(ft of 2 ) [y 1 -^ 
ih ,t2)\y^^ 
cycle(t) [y 
(x < f) [y Ij 


y2 1-^ S2] = A«j,i,y2» o(i'l , S2) 


t\o{t2 [y ij) 

((fi [y Jl), (t 2 [y 1-^ ‘^)) 
cycle(f [y 5j) 

(x < t [y^ ^) 


Note fhaf t [y denofes a mefa-level subsfilufion operation, nol an explicif subslifulion. 


2.3 Equational theory 

For terms F h 5 : A and Y \-1 : A, an (UnCAL) equation is of the form Y \- s = t : A. Hereafter, for 
simplicity, we often omit the source A and target F contexts, and simply write 5 = f for an equation, but 
even such an abbreviated form, we assume that it has implicitly suitable source and target contexts and 
is of the above judgemental form. 
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Composition 

(subl) t <>(y < 

for y \-1 : X 

Parameterised fixpoint 

(fix) cycle (0 - io(iclF,cycle( 0 ) 

(Bekic) cycle((l, 5 )) = <;T 2 ,cycle( 5 ))o 
(idy, cycle(i o (idy^, cycleC^)))) 

(naty) cycle(f) 05 = cycle(f <>{sx id^)) 
(natx) cycle(5 <>t) = so cyc\e{t o (idy x 5)) 
(Cl) cycle((i o (idx x pi),..., f o (idx x pj)) 

= AmOcycle(fo(idxX A„,)) 


Deleting trivial cycle 

(c2) cycle(A) - id 

Commutative monoid 

(unite A) A o ({ }o X id) = id 

(assocA) A o (id X A) - A o (A X id) 

(comA) A oc = A 

Degenerated bialgebra 
(compa) A o A = ( A X A) o (id X c X id) o (A X A) 
(degen) A o A = id 


Figure 4: Axioms AxGr for UnCAL graphs 

Fig. m shows our proposed axioms AxGr to characterise UnCAL graphs. These axioms are chosen 
to soundly and completely represent the original bisimulation of graphs by the equality of this logic. 
Actually, it is sound: for every axiom s = t, s and t are bisimilar. But completeness is not clear only from 
the axioms. We will show it in ^ 

The axiom (subl) is similar to the /3-reduction in the T-calculus, which induces the axioms for carte¬ 
sian product (cf. the derived theory below). The cartesian structure provides a canonical commutative 
comonoid with comultiplication A. 

Two terms are paired with a common root by {5') u (t) = A o (5, t). The commutative monoid axioms 
states that this pairing {-) u {-) can be parentheses free in nested case. The degenerate bialgebra axioms 
state the compatibility between the commutative monoid and comonoid structures. The degenerated 
bialgebra is suitable to model directed acyclic graphs (cf. lfT4l §4.5), where it is stated within a PROP 
ED. The monoid multiplication A expresses a branch in a tree, while the comultiplication A expresses 
a sharing. Commutativity expresses that there is no order between the branches of a node, cf. (commuu) 
in the derived theory below, and degeneration expresses that the branches of a node form a set (not a 
sequence), cf. (degen’). 

Parameterised fixpoint axioms axiomatise a fixpoint operator. They (minus (Cl)) are known as the 
axioms for Conway operators of Bloom and Esik [T], which ensures that all equalities that holds in 
epo semantics do hold. It is also arisen in work independently of Hyland and Hasegawa lfT5]l . who 
established a connection with the notion of traced cartesian categories ll20l . There are equalities that 
Conway operators do not satisfy, e.g. cycle(t) = cycle(t o t) does not hold only by the Conway ax¬ 
ioms. The axiom (Cl) fills fhis gap, which corresponds to the commutative identities of Bloom and 
Esik 0]. This form is taken from i25ll and adopted to the UnCAE setting, where A^ - (id&, • • •, idg,), 
T = ((yi,...,ym)), & I-: T, X+Y\-t : Y \-: T such that p,-= (^,iwhere each is 

one of F h tt,- : & for / = 1,...,m. The axiom (c2) (and derived (cl) below) have been taken as necessary 
ones for completeness for bisimulation used in several axiomatisations, e.g. |[2^ l5l[T^. 

The equational logic EL-UnCAL for UnCAE is a logic to deduce formally proved equations, called 
(UnCAL) theorems. The equational logic is almost the same as ordinary one for algebraic terms. The 
inference rule of the logic consists of reflexivity, symmetricity, transitivity, congruence rules for all 
constructors, with the following axiom and the substitution rules. 


(Ax) 


(F h s = f : A) 6 £ 
F i-i = f :A 


(Sub) 


W 1 — f \X Z Si - s'. :yi (I < i < k) 
Z + Y' ht\y\-^ s\^t'\y^ :X 


The set of all theorems deduced from the axioms AxGr is called a (UnCAL) theory. 
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Derived theory. 

The following are 

; formally 

derivable from the axioms, thus are theorems. 

(tmnl) t - 

= ()y for all T 1 - i 

^ : «» 

(dpair) 

{t\,t2)^S = {tiOS,t20s) 

(fst) ni 

o{s,t} - s 


(fsi) 

{711,712) = id 

(snd) 7T2<>{s,t) - t 


(SP) {Tl\Ot ,Tl20t) = t 

(bmul) 

()&>^()& - 

()& ^ A 

(bcomul) 

^ ^ { }o “ ({ lo A { }o) 

(unitRA) 

Ao(idx{)o) - 

id 

(bunit) 

()&o{}o - id 

(cl) 

cycle(id) 

do 

(commu) 

{x}u{t} ^ {t}u{x} 

(unRo) 

f^id 

t 

(unitu) 

{{}}u{t} - t^MuU)} 

(unLo) 

id Of 

t 

(assocu) 

{{x} u {t}} u {m} ^ {x} u {{t} u {m}} 

(assoco) 

{sot)ou - 

so{tou) 

(degen’) 

{t} u {t} = t 


Because of the first three lines, UnCAL has the cartesian products. For (cl), the proof is 

cycle(id) cyQ|e(A o ({ }o x id)) cycle(A) o { }o id o {lo- - {}o- 


Lemma 2.2 Under the assumption ofDef. I2.il the following is an UnCAL theorem. 
(sub) to(5i , ■ , 5i:,idy') = t [jf 1 -^ 


3 Algebraic Semantics of UnCAL 

In this section, we consider algebraic semantics of UnCAL. We also give a complete characterisation of 
the structural recursion, where e can depend on t in (★). 


3.1 Iteration 2-Algebras 

We first review the notion of iteration X-algebras and various characterisation results by Bloom and Esik. 
Let X be a signature, i.e. a set of function symbols equipped with arities. We define ju-ferms by 

t X \ fiti,...,tn) I PX.t, 

where v is a variable. We use fhe convention fhaf a funcfion symbol e X denotes n-ary. For a sef V 
of variables, we denofe by T(F) fhe sef of all yu-ferms generafed by V. We define ConwayCI as fhe sef of 
following equafional axioms: 


Conway equations px.t[slx] 

px.py.t 

Group equations associated with a group G 

px. (t[l ■ x/x],... ,t[n- x/x])i 


t[px. x[f/x] /x], 

px. t[x/y] 

py. {x\y/x],...,\y/x]) 


Note fhaf the fixed point law 


px.t - t[px.tlx] 


is an insfance of fhe firsf axiom of Conway equations by faking s = x. The group equafions lITTIl known as 
an alternative form of fhe commutative identities, are an axiom schema parameterised by a finite group 
(G, •) of order n, whose elements are natural numbers from 1 to n. We also note that the /i-notation is 
here extended on vectors (ti,...,t„), and (-)i denotes the first component of a vector. Given a vector 
x = {x\,...,Xn)oI distinct variables, the notation i-x = (x,.i,...,x;.„) is used. 


M. Hamana 


83 


Definition 3.1 ([4 |) A pre-iteration 1,-algebra (A,(| - D^) consists of an nonempty set A and an interpre¬ 
tation function d - : T(y) x A^ ^ A satisfying 

(i) d Da ^ Pix) for each xeV (iii) d t Da ^ d Da ^ Da ^ (ipx. t' Da- 

(ii) d t[ti jxi,---, tnlxn] D^ = d f Da with p'{xi) = d b D^, p'{x) = p{x) for a + Xi 

A pre-iteration Z-algebra can be seen as a Z-algebra (A, {/a | / £ S)) with extra operations d/^x. jDa 
for all t. A pre-iteration Z-algebra A satisfies an equation s - t over //-terms, if d'^DA = d^DA- Let E 
be a set of equations over //-terms. An iteration 1-algebra is a pre-iteration E-algebra that satisfies all 
equations in ConwayCI. An iteration {l,E)-algebra is an iteration E-algebra that satisfies all equations 
in E. A homomorphism of iteration E-algebras // : A —> B is a function such that //odtDA - d^D°^^ 

for all t. Since the variety of iteration E-algebras is exactly the variety of all continuous E-algebras ([4] 

Introduction), the interpretation of px. t in an iteration E-algebra can be determined through it. 

We now regard each label f £ L as an unary function symbol. Then we consider an iteration L U 
-t-'-^^l-algebra. We define the axiom set AxBR by 

s + {t + u) = {s + t)-\-u s-\-t =t-\-s t-\-0-t 

px. X =0 px. (x -I- y) = y for y not containing x 

and AxCBR = ConwayCI U AxBR. We write AxCBR l-^ 5 = t if an equation 5 = t is derivable from 

AxCBR by the standard equational logic EL-// for //-terms. For example, idempotency is derivable: 

AxCBR t 1 — t 

The proof is f = px.{x -\-t) = {px.{x -\-t))-\-t = t-\-t, which uses the last axiom in AxBR and the fixed point 
law. Since //-terms can be regarded as a representation of process terms of regular behavior as Milner 
shown in |[23]| (or synchronization trees |21), the standard notion of strong bisimulation between two 
//-terms can be defined. We write 5 ~ t if they are hi similar. 

Theorem 3.2 (Sllllllll) 

(i) The axiom set AxCBR completely axioniatises the bisiniulation, i.e., AxCBR s - t s ~ t 

(ii) The set T(y) of all p-terms forms a free pre-iteration 1-algebra over V. 

(iii) The set EPR of all regular L-labeled trees having V-leaves modulo bisimulation forms a free itera¬ 
tion (L U {0, -\-},AxBR)-algebra over V ( l(T2\l below Lemma 2, l\24\l Thm. 2). 

Note that SK stands for Regular trees modulo Bisimulation, and AxBR stands for the axioms for regular 
trees modulo bisimulation. 

3.2 Characterising UnCAL Normal Form s 

UnCAL normal forms. Given an UnCAL term t of type &, we compute the normal form of t by the 
following three rewrite rules (N.B. we do not here use the other axioms) as a rewrite system (T], which 
are oriented equational axioms taken from the derived theory, AxGr and abbreviations. 

(sub) , ■■•, 5^,id) = t[yi-^5^ 

(Bekic) cycle((t, 5 )) = (/r2,cycle(5')) o (idA,cycle(to (idAxy,cycle(5)))) 

(union) A<>{s,t) = {5}u{t} 
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Let A4 be the set of all rewriting normal forms by the above rules, which finally erases all (-, -) and o in 
a given t. Normal forms are uniquely determined because the rewrite rules are confluent and terminating, 
hence have the unique normal form property |5j- Then by induction on terms we have that terms in M 
follow the grammar 

M3t::= y | f:t | cycle^(0 | {} | {i'}u{t} | (x < t). 

Any outermost definition must be of the form (& < t') by the assumption that the original given t is of 
type &, thus we identity it with t'. Other definitions appear inside of t, as the following cases: 

• Case {{xi < fi)} u {(x 2 < 12 )]- We identify it with merely {ti} u { 12 }, because marker names xi,X 2 
are hidden by this construction. 

• Case Y \- cycle^(.r < t') : x. We identify it with merely cycle®'(T). because these are equivalent by 
renaming of free maker x. 

The UnCAL normal forms N are obtained from Af by these identifications. It is of the form 

N B t ::= y \ (:t \ cycle^(0 | {} | {i'}u{t} 

T(L) 9 t ::= y \ £{t) \ px\ _ pxn-t \ 0 | s + t 

Every normal form bijectively corresponds to a //-term in T(E), i.e. N = T(E), because each the above 
construct corresponds to the lower one, where X - {{x\ ,..., x„)). Hereafter, we may identify normal forms 
and //-terms as above. Define the pair of signature and axioms by 

UnC = (LU{0,-t}, AxBR). 

We regard an arbitrary UnC-algebra J{ as an algebraic model of UnCAL graphs. First, we show the 
existence of a free model. Define AfcBR to be the quotient of N by the congruence generated by AxCBR. 

Proposition 3.3 

Ncbr forms a free iteration UnC-algebra over V. Thus for any function 
if \V ^ IR, there exists an unique UnC-algebra homomorphism if^ such that 
the right diagram commutes, where q is an embedding of variables. 

Proposition 3.4 Ncbr = IBK. 

Proof By Theorem l3.2l (iii). □ 

3.3 Completeness of the Axioms for Bisimulation 

Buneman et al. formulated that UnCAL graphs were identified by extended bisimulation, which is a 
bisimulation on graphs involving e-edges. As discussed in ^ since our approach is to use only UnCAL 
terms, it suffices to consider only the standard (strong) bisimulation between UnCAL terms, as done in 
ll2^ [3l [T^fT3l . We denote by ~ bisimulation for UnCAL term. 

In this subsection, we show the completeness of AxGr for bisimulation, using the following Lemma 
I3.5l that reduces the problem of EL-UnCAL to that of EL-// through UnCAL normal forms. AxCBR has 
been shown to be complete for the bisimulation Si. 

Lemma 3.5 For UnCAL normal forms n,me N, AxCBR \-^n = m Y \-n = m : X is derivable from 
AxGr in EL-UnCAL. 
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Proof. [=>] : By induction on proofs of EL-/i. For every axiom in AxCBR, there exists the corresponding 
axiom in AxGr or an EL-UnCAL theorem, hence it can be emulated. 

[<=]: By induction on proofs of EL-UnCAL. Let 5 = f is an axiom of EL-UnCAL. It easy to see that taking 
normal forms of both side, they are equal term, or correspond to an axiom in AxCBR or EL-/r theorem. 

□ 


Theorem 3.6 (Completeness) AxGr is sound and complete for the bisimulation, i.e., 

Y s = t : X is derivable from AxGr in EL-UnCAL iff s ~ t. 

Proof [=>] : Because every axiom in AxGr is bisimilar, and the bisimulation is closed under contexts 
and substitutions | 8 ]. 

[<=] : Suppose s ~t. Since for each rewrite rule for the normalisation function nf, both sides of the rule is 
bisimilar, nf preserves the bisimilarity. So we have s ~ nf(5) ~ nf(t) ~ t. Since AxCBR is complete axioms 
of bisimulation Il3l[l2l, AxCBR nf(5) = nf(t). By Lemma IT5l we have a theorem Y \- nf(5) = nf(f) : X. 
Thus s = t is derivable. □ 


3.4 Interpretation in Algebraic Models 


n 


V 


nf 


T(y) = U 


To interpret UnCAL terms and equations, we connect two freeness results in Thm. 

Since UnCAL normal forms Af is isomorphic to a free pre-iteration algebra TfU), it has the 
universal property. Define 7& to be the set of all well-typed UnCAL terms of type &. 
We define nf : T& —> Af by the function to compute the UnCAL normal form 
of a term. Then for any derivable equation T 1 - 5 = f : A in EL-UnCAL, we 
have AxCBR 1 - nf(s) = nf(t) by Lemma [T5l thus for all assignment f : V 

nf(5) D'' = nf(0 D'' 

where rj and 77 ' are embedding of variables. 

Since AfcBR = we name the isomorphisms (-) : A/cbr ^ SK and 
(-) : SH AfcBR- We write simply a normal form t to denote a represen¬ 
tative [t] in AfcBR- Thus given a normal form t (which is a syntactic term, 
always finite), f is a (possibly infinite) regular tree by obtained by expand¬ 
ing cycles in t using fixpoints. Conversely, notice that since f is a tree, there are no cycles and the original 
cycles in t are infinitely expanded. Since N = T(y), the functions (-) may also be applied to /i-terms. 
The iteration UnC-algebra SP. has operations 0 s« = {}, -i-s?^(r, 5) = {r) u {5), isnff) = ((f)- 



3.5 Deriving structural recursion of involved case 

Next we model UnCAL’s structural recursion of graphs. We use pairs of “the recursive computation” and 
the history of data structure. This is similar to the technique of paramorphism which is a way to 
represent primitive recursion in terms of “fold” in functional programming. Our universal characterisa¬ 
tion of graphs is the key to make this possible by the unique homomorphism from the free pre-iteration 
UnC-algebra N using the above analysis. 

We take a term X h ei{v,r) : X involving metavariables v and r, where e({F{t),t) is the right-hand 
side e of F((: t) in (★) . For example, in case of the example f 1 in Introduction (see also Example 13.91) . 
we take 

e{(v,r) = result: r, e({F{t),t) = result: t i(( = ethnicGroup 

e({v,r) = V, e({F{t),t) = F{t) i(( ethnicGroup 
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We construct a specific iteration UnC-algebra BUe for Let k = |X|. Without loss of 

generality, we can assume that e{{v,r) is of the form (ti, ■■■, 4) where every t,- is a normal form. We 
define the iteration UnC-algehra SKe - x SK having operation 

^SHeiv,r) ^ { ec{v,r) , fir ), OsKe = (U>U) 


and +s'Re is an obvious tuple extensions of +Bii- Here {) is the ^-tuple 
use this convention o of tuple extension of an operator o. 

Then, two freeness results in Thm. I3.2I are depicted in the right diagram, 
where p{x) - - ,Xk,x). Since T(L) = N, the interpretation in SHe is 

described as 

d ^ d { } d ^ {f} d d 1 \}S‘Re 

d t t d cycle(0 = ? 7 ^( cycle( 0 ) 

Now d - is characterised as the unique pre-iteration LU {0,-i-}-algebra 
homomorphism from T(L) that extends p. Defining 

if is fhe unique funcfion satisfying 


of {}. Hereaffer, we will 
77" 

V -U T(l/) = U 



^yv^BR 


0(x) ^ X^), (;6({))^n, 0(1.9} U {?}) = (^(s) U 0(0 , 

<p(f: t) = efivj), 0(cycle(f)) = tti o ? 7 *(cycle( 0 ) 


The funcfion 0 fakes normal forms of fhe type &. For non-normal forms, jusf precompose nf, i.e., define 
fhe funcfion <1) : 7& ^ -^cbr - 0(nf(9)), fhus, <1)1^1 '-Tx^ yVcBR^'^' ^ because Tx = In 

summary, we have fhe following, where 9 is a possibly non-normal form 

0 ( 9 ) =0(nf(9)) 0(x) ={x\,---,Xk) 0({})^{} 

(Dl^l+TI((fi , f 2 » = $1^1(4) X (l)l^l(f 2 ) 0(^:0 =e({(p{t),t) 0(4 ut2) ^ 0(4) u 0(t2) (2) 

‘5”(()) =0 0(cycle(t)) = TTi o? 7 #(cycle(f)) 


where x is fhe “zip” operator of fwo fuples. Here we use a map yVcBR ^ Tm(F) to regard a normal form 
modulo AxCBR as a ferm, for which any choise of represenfafive is harmless, because UnCAL graphs 
are idenfified by bisimulafion and AxCBR axiomafises if. Identifying fhree kinds funcfions <l),Ol^l ,0 as 
a single funcfion (also denofed by O, by abuse of nofion) on Tm(F), fhis O is essentially whaf Buneman 
ef al. lUl called fhe sfrucfural recursion on graphs for fhe case fhaf e depends on t. Acfually, we could 
make fhe characferisafion more precise fhan f8|, i.e., we obfain also fhe laws for fhe cases of o (by fhe 
case 0(9) = 0 (nf( 9 ))) and cycle, which fells how to compute fhem. 

This is nol merely rephrasing fhe known resulf, buf also a sfronger characferisafion, which gives 
precise undersfanding of fhe sfrucfural recursion on graphs: 

(i) Buneman el al. slated lhal ([Til wilhouf (x) is a property ([8] Prop. 3) of a “sfrucfural recursive 
funcfion on graphs” defined by fhe algorilhms in [8.]. This property (i.e. soundness) is desirable, 
buf unforfunafely, no completeness was given. There may be many functions lhal salisfy fhe prop¬ 
erly. In conlrasl to if, our characferisafion is sound and complete'. (2) determines a unique function 
by fhe universalily. 
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(ii) This derivation does not entail 0(5of) - 0(5)oO(f). It tells us that the only way to compute 0(5of) 
is to compute the normal form of 5 o f and then apply 0. 

(iii) This analysis does not entail 0(cycle(f)) = cycle(0(f)) either. The iteration algebra structure tells 
us that the homomorphism (p maps a term cycle(f) to its interpretation in &Re where the cycles 
are expanded in a regular tree and at the same time, labels I are interpreted using the operations of 
SKe. 

(iv) The structure preserved by structural recursion is the (pre-)iteration algebra structure. The struc¬ 
tural recursive function <p is the composition of a pre-iteration algebra homomorphism, an a itera¬ 
tion algebra homomorphism and a projection. 

3.6 Examples 

We may use the notation {fi,f 2 ,...) as the abbreviation of {fi} u {f 2 } u • • •. 

Example 3.7 ([iBl Replace all labels with a) This is the example considered in Introduction, 
sfun f2(L:T) = a:f2(T) 

In this case, the recursion does not depend on T (because the right-hand side uses merely f2(T)). We 

define the iteration UnC-algebra S'Re by 


is'Re{v,r) = (a:v, t.r). 

(We may omit over and underlines to denote the isomorphisms for simplicity). Then <1) is the desired 
structural recursive function f2. E.g. 

0 (b:cycle(c:&)) = a:(/>(cycle(c:&)) = airri o77**(c:c: •■■) = a:(a:a: •••) = a:cycle(a:&) 

Example 3.8 (181 Double the children of each node) 

sfun f4CL:T) - {a:f4CT)) u {b;f4CT)} 

Example of execution. 

£4(a:b:c:{D 

{a:{ a:{a:{), b:{)}, b:(a:{}, b:{}) }) u {b:{ a:(a:{}, b:{}), b:{a:{), b:{)} )} 

This case does not depend on T. We define fhe iferafion UnC-algebra IBUe by 

r) = ({a: v} u {b: v}, r). 

Then d) gives fhe sfrucfural recursive function defined by f4. 

Example 3.9 (fSl Retrieve all ethnic groups) We revisit the example given in ® 

Eor the structural recursive recursive definition of f 1, 

sfun fl(L:T) = if L = ethnicGroup then (result:!) else flfl) 

This case does depend on T. Example of execution: 

fl(sd) ^ {result:"Celtic":0, result:"Portuguese":!}, result:"Italian":{)} 
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We define the iteration UnC-algebra Slie by 

ethnicGroupg^g(v, r) = (result: r, ethnicGroup: r) 
fsKe{v,r) = (v, f: r) for € t ethnicGroup 

Then <1) is the structural recursive function defined by f 1: 

O (sd) = {result:"Celtic":{), result:"Portuguese":{}, result:"Italian":{}} 

Example 3.10 Consider another example in SJof aa?. This case does depend on T. We define the 
iteration UnC-algebra by 

as^e(v, r) ^ (a?(r), a: r) 
f&Re{v, r) ^ (v, €: r) for f a. 

Then d) gives the structural function aa? 

OCCa:&)(aCa:{})) = 0( nf((a:&)@(a:{})) ) ^ ^Ca:a:{)) ^ true:{| 

0 (cycle(a:&)) = tti o77^(cyclGCa:&)) = n\ o? 7 *^(a: a: = (a?(a: •••),&:•••) = true : {} 


4 Conclusion 

In this paper, we have shown an application of Bloom and Esik’s iteration algebras to model graph data 
used in UnQL/UnCAL for describing and manipulating graphs. We have formulated UnCAL and given 
an axiomatisation of UnCAL graphs that characterises the original bisimulation. We have given algebraic 
semantics using Bloom and Esik’s iteration iteration algebras. The main result of this paper was to show 
that completeness of our equational axioms for UnCAL for the original bisimulation of UnCAL graphs 
via iteration algebras. As a consequence, we have given a clean characterisation of the computation 
mechanism of UnCAL, called “structural recursion on graphs” using free iteration algebra. 

Acknowledgments. I am grateful to Kazutaka Matsuda and Kazuyuki Asada for discussions about 
UnCAL and its interpretation, and their helpful comments on a draft of the paper. A part of this work 
was done while I was visiting National Institute of Informatics (Nil) during 2013 - 2014. 
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